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METHOD AND SYSTEM FOR THE AUTOMATIC COLLECTION 
AND CONDITIONING OF CLOSED CAPTION TEXT ORIGINATING FROM 
MULTIPLE GEOGRAPHIC LOCATIONS, 
AND RESULTING DATABASES PRODUCED THEREBY 

5 

RELATED APPLICATIONS 

This application is a continuation-in-part of U.S. Patent Application Serial No. 
09/332,723 filed June 14, 1999, now U.S. Patent No. 6,266,094 granted July 24, 2001. 
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BACKGROUND OF THE INVENTION 

M 1. Field of the Invention. 

| a i 

15 The present invention relates to methods and systems for monitoring, decoding, 

sj transmitting, and archiving of closed caption texts from television broadcasts. More particularly, 

yj the present invention relates to methods and systems for the automatic collection and 

Q 

conditioning of closed caption texts originating from multiple geographic locations, and resulting 
databases produced thereby. 

20 

2. Description of the Prior Art 

In the United States, television stations currently create more than 12,000 hours of local 
news programming every week. Network and cable news organizations broadcast an additional 
25 1,400+ hours. Because every newscast contains references to specific persons, organizations, 
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and events, an entire industry has grown up to monitor newscast content on behalf of 
newsmakers. The traditional monitoring approach required workers to videotape, view, and 
summarize the content of TV newscasts. However, using such a traditional method, it is very 
difficult to monitor every newscast on every channel on a timely basis. Thus, a need exists for 
5 newsmakers and other interested parties to have comprehensive and cost effective real time 
access to a database of newscast content. 

Closed captioning, which is mandated by the Federal Government for most television 
programs, is a textual representation of the audio portion of a television program. Originally 


1@D devised as a means for making program dialogue accessible to the deaf and hearing impaired, 
P 

: s 

closed captioning is often displayed now for the convenience of non-deaf persons in 

2 environments where television audio is not practical, such as noisy restaurants and airport kiosks. 

SSS. 

3 Closed captioning is encoded into the video blanking intervals (VBI), which are part of the video 
M component of a conventional television signal. In the United States, line 21 of the VBI is 


One approach to monitoring television broadcasts by using closed caption text is 
disclosed in U.S. Patent No. 5,481,296, issued January 2, 1996, to Cragun et al., and titled 
APPARATUS AND METHOD FOR SELECTIVELY VIEWING VIDEO INFORMATION. 
20 The Cragun et al. system provides a closed caption decoder that extracts the closed caption text 
from a television broadcast. A viewer specifies one or more keywords to be used as search 
parameters and a digital processor executing a control program scans the closed caption text for 
words or phrases matching the search parameters. The corresponding complete video recording 


n 


Iff reserved for carrying closed captioning. 
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bf the television broadcast may then be displayed, edited, or saved. In one mode of operation, 
the Cragun et al. system may be used to scan one or more television channels unattended and 
save items that may be of interest to the viewer. In another mode of operation, the Cragun et al. 
system may be used to assist in quickly locating previously stored video recordings. One clear 
5 disadvantage of the Cragun et al. system is that extremely large amounts of memory are required 
to store the video segments. 

One approach to monitoring television broadcasts by using closed caption text is 
disclosed in U.S. Patent No. 5,809,471 issued September 15,1998 to Brodsky et al and titled 
1<B RETRIEVAL OF ADDITIONAL INFORMATION NOT FOUND IN INTERACTIVE TV OR 

TELEPHONY SIGNAL BY APPLICATION USING DYNAMICALLY EXTRACTED 
S VOCABULARY. Significant limitations of the Brodsky patent are that server based features are 
J" missing and only single closed caption data is monitored from a specific geographic site, as 

M opposed to broad geographical and dispersed sites in the present application. As such, the 

ni 

W present design has features and benefits that are not in the Brodsky design. 

P""s 

SUMMARY OF THE INVENTION 


It is an object of the present invention to provide a substantially real-time, 
20 comprehensive, and cost effective means for the monitoring, decoding, transmission, filing and 
retrieval of television word content through the client server based processing of closed caption 
text. It is another object of the invention to provide such means that makes such text accessible 
to end users via the Internet or other communication networks. It is another principal object of 
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the present invention to provide automatic delivery of such search resultant text to non-current 
(i.e., prior inquiry) clients. It is a further object of the present invention to provide an automated, 
minimal cost, sales promotional tool to prospective customers using the method, system, and 
database of the present invention. Other objects of the present invention, as well as particular 
5 features, elements, and advantages thereof, will be elucidated in, or be apparent from, the 
following description and the accompanying drawing figures. 


The foregoing objects of the present invention are achieved by providing a system for the 
automatic collection and conditioning of closed caption texts originating from multiple 
1QQ geographic locations, comprising: (1) at least one remote capture client means having a tuner to 
^ receive one or more television signals, a decoder to decode closed caption text stream in the 
j*J television signals, and means to write the closed caption text stream to a text file, (2) central 
7* server means operatively connected to the remote capture client means for storing the text files 

Sj and making the text files available to a user, and (3) an inquiry client means operatively 

ni 

lSJ connected to the central server means for searching the text files. The central server is adapted to 
automatically process search requests from the inquiry client and notify the inquiry client via any 
suitable communications means, and particularly so when the inquiry client is not in active 
communication with the central server. 


20 In one preferred embodiment, the non-current inquiry client would receive an electronic 

message (i.e., e-mail) containing the substantially real-time search resultant texts matched to a 
prior existing same client search inquiry profile. 
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BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 is block diagram of a system for capturing, processing, and displaying closed 
caption text according to the present invention having a remote capture client, an inquiry client, 
5 and a central server; 

Figure 2 is a block diagram representing two methods of transferring text files from the 
remote capture client to the central server; 

1(P Figure 3 is a block diagram representing the operation of the remote capture client of Fig. 

is '5 

3 Figure 4 is a block diagram representing the operation of the inquiry client of Fig. 1. 

si 
fu 

M Figure 5 is a block diagram representing the operation of the central server of Fig. 1 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Referring to the drawings and, in particular, Fig. 1, there is illustrated a system according 
20 to the present invention having remote capture client means 100, inquiry client means 1 10, and a 
central server 130. 
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Remote capture client means 100 generates decoded closed caption text files. Remote 
capture client means 100 may be operated as a client or as a server {see Fig. 3). Remote capture 
client means 100 sends a text file 140 to central server means 130 through communication means 
120 using a client to server networking program 260. Communication means 120 may be any 
5 suitable method of communication, including the internet, a local area network (LAN), and/or a 
wide area network (WAN). Client to server networking program 260 can manage numerous text 
files arriving from a plurality of remote capture client means. 


Inquiry client means 110 is used to accesses central server means 130 for the purpose of 
lffl searching closed caption text files stored on central server means 130. Inquiries and results are 
transmitted between inquiry client means 110 and central server means 130 via communications 

Mi 

Yl means 120. The connection between inquiry client means 1 10 and central server means 130 may 

MS 

be user-initiated or "interactive." In the alternative, the connection may be automatically 
Sj initiated by central server means 130 at a pre-selected time or when results from a pre-specified 
tM search are available. 

Central server means 130 is connected to both remote capture client means 100 and 
inquiry client 1 10. Central server means 130 collects, conditions, and stores text files received 
from remote capture client means 100. In addition, central server means 130 receives inquiries 
20 from inquiry client 1 10, processes those inquires, and returns search results to inquiry client 1 10. 
Preferably, central server means 130 performs its tasks on a continuous basis, 24 hours a day, 7 
days a week. 
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Fig. 2 illustrates that remote capture client means 100 may be connected directly and/or 
indirectly to central server means 130. An indirect connection between remote capture client 
means 100 and central server means 130 passes through a remote capture server means 220. 


5 Remote capture client means 100 may be located geographically, strategically, 

economically, and/or conveniently. In many locations, remote capture client means 100 can 
satisfy all of the closed caption capturing requirements and, as such, a lower cost system may be 
employed wherein remote capture client means 100 communicates directly with central server 
means 130 through client to server networking 260 {see also Fig 1). In the alternative, remote 

Q 

ljjj capture client 100 may be employed together with a supplemental capture client means 100a. 

J3 

M= When supplemental capture client means 100a is used in the system, remote capture client means 
W 100 and supplemental capture client means 100a are preferably connected to remote capture 
^ server 220 via a suitable communications means, such as the internet, a LAN, and/or a WAN. 
Remote capture client means 100 and supplemental capture client means 100a send text files to 

nj 

ljjy remote capture server means 220 rather than directly to central server means 130. Remote 

; I 

U capture server means 220 collects text files from both remote capture client means 100 and 
supplemental capture client means 100a. Remote capture server means 220 communicates to 
central server means 130, as time, conditions, or other requirements are satisfied. For example, 
remote capture server means 220 may collect information from its assigned capture clients 100, 
20 100a during a twenty-four hour period and transfer the collected text files to central 

130 in a single communication session via server to server networking 240. Moreover, remote 
capture client means 100 may be adapted to directly or indirectly communicate with 
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supplemental capture client 100a for centralization, redundancy, load sharing, and/or cost saving 


reasons. 


In a preferred embodiment of the present invention, text files are transferred from remote 
capture client means 100 to central server means 130 across the internet using a protocol, such as 
File Transfer Protocol (FTP). An FTP connection may be controlled either by remote capture 
client means 100 or central server means 130. When an FTP connection is controlled by remote 
capture client means 100, the FTP connection is referred to as a "push" connection because the 
text files are transferred by an outgoing FTP connection. When an FTP connection is controlled 
by central server 130, the FTP connection is referred to as a "pull" connection because the text 
files are accepted from an incoming FTP connection. However, some servers reject incoming 
FTP connections. For example, a firewall restricts the ability of a server to "pull" data. 
Nonetheless, all servers can "push" data. Thus, it is preferable that the system according to the 
present invention be designed to "push" rather than "pull" text files. 


FTP sessions between remote capture client 110 and central server 130 may transfer 
entire directory structures. Optionally, the source text files may be deleted from local capture 
client 100 after the transfer is complete and the system has verified that the text files exists on 
central server 130. This is a true "move" operation. Of course, if the text files are not 
successfully transferred, the system will re-transfer the sources files during the next FTP session. 
The system may establish an FTP session between remote capture client 100 and central server 
130 automatically either (1) when a text file is created at remote capture client 100, (2) a specific 
time interval has elapsed (e.g. 60 minutes), and/or (3) a specific time has been requested (e.g. 2 J 
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AM). A preferred FTP API is available in Microsoft's Windows 2000. Optional features may 
be implemented in the present system, method, and database using an API wrapper. 


Remote capture client means processing is outlined by Fig. 3. Remote capture client 
5 means processing begins with one or more conventional signal sources 310, which may be 
received by a broadcast antenna, broadcast cable, video tape player, or any other video source. 
Convention signal source 310 is received by to a tuner 320. Tuner 320 is tuned to receive a 
desired frequency by hardware and software functions contained in a setup, scheduling, and 
programming means 360 via a graphical user interface 370. Preferably, the setup, scheduling 
ljg and programming means 360 manages tuner 320, as well as a decoder 330, and a text handler 
U 340. 

Tuned signals are passed from tuner 320 to decoder 330. Decoder 330 interprets and/or 
^\ decodes any closed caption text in the tuned signals. A preferred embodiment of decoder 330_ 

Lj£i has been developed by Medialink Worldwide, Inc. and is known as TeleCap™. Other suitable 

O 

M* decoding programs are commercially available and include, by way of example, Le Petit 
Decoder™ by SoftNi Corporation. 


Decoder 330 passes the interpreted closed caption text to text handler 340, which writes 
20 the closed caption text into a text file. Text handler 340 can insert useful information into the 
text file including broadcast related data, such as time, date, broadcast station identifiers, 
broadcast market identifiers, broadcast station city, program title, and program actors and/or 
participants. 
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Text output process 350 sends the text file from text handler 340 either to central server 
means 130 or a remote capture server means 220 {see Fig. 2). Preferably, text output process 
350 is initiated after a text file is closed. Closing a particular text file can be accomplished under 
5 user specified control or based upon system pre-set parameters governed by the application. For 
example file closing may occur automatically when the recorded program concludes, a specific 
time interval has elapsed (e.g. 60 minutes), or a specific time has been requested (e.g. 2 AM), as 
well as other opportunities as warranted. 

l[g Tuner 320, decoder 330, text handler 340, setup, scheduling, and programming means 

U 360, and graphical user interface 370 may be incorporated into a single hardware device. A 
number of commercially available computer hardware devices and internal computer boards 

: s 
?™= 

^ provide tuning and decoding capabilities. For example, a preferred embodiment of the present 

O 

invention makes use of a computer board manufactured by the PosTech Company of Madison, 

PS s 

= m 

ljjji Wisconsin. However, any closed captioning decoding device known in the art may be used as 
M= part of the present invention. 


Preferably, both remote capture client means 100 and central server means 130 can 
manage aspects of remote capture client means processing and operation, such as 1) 
20 initialization, 2) programming, 3) upload and download times, 4) error processing and 5) 
updating. 


10 


J3 


?5E 


# # 


MEDIA0003CIP 


Figure 4 outlines inquiry client means processing. A user employs a graphical user 
interface 410 to define a text inquiry definition 420. Text inquiry definition 420 is structured by 
a send command and control program 430 for submission to central server means 130 via an 
inquiry output program 440. Text inquiry definition 420 can be maintained on inquiry client 110 
by an inquiry filing and management program 490. This allows the user to review previous 
postings and to use the same inquiry or a modified inquiry in the next search interval. 

Text inquiry definition 420 may also be maintained as part of a detailed search profile. 
Such a search profile will include information about the text inquiry definition, such as key 
words, key phrases, selected stations, selected markets, when and/or if the text inquiry definition 
should be compared against the database of text files (e.g., every 24 hours for 30 days), and 
limits to the search results (e.g., the first 100 records found). A search profile will also include 
information about the user that submitted text inquiry definition 420, such as name, password, e- 


sj mail address, and limitations on access (e.g., 5 text inquiry definitions per month). 


M In preferred embodiments, users have the option to permanently save one or more 

profiles for later retrieval and use during an active user-initiated search or alternatively, in an 
automatic current user interactive search. Moreover, users have the option to share permanently 
saved profiles with other selected users. In addition, profiles must necessarily be permanently 

20 saved for some discrete time frame on central server means 130 as part of automated search 

routines, such as automatic search processing 580 (see Fig. 5). The search profiles can be saved 
at central server means 130 and/or inquiry client means 1 10. To conserve resources, if a given 
profile is not permanently saved, central server means 130 maintains the profile for a pre- 
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determined interval (e.g., one day) and then removes the profile. Preferably, removing profiles 
from central server means 130 is done when system demand is low. A preferred routine for 
removing profiles from central server means 130 uses a scheduled command, such as Microsoft 
SQL Server Agent, which is available as part of Microsoft SQL Server 6.5 and in newer 
versions. 

A return posting 460 from central server means 130 is processed by the receive command 
and control routine 470 to manage any interface exchange or conversion option required by the 
application. Return posting 460 may be in one or more forms, such as an interactive results 
return or an automatic notification. Search results returned to inquiry client 1 10 will be 
displayed at graphical user interface 490 as formatted by reporting options routine 480. For 
example, the search results can be displayed as a list of program citations, with each citation 
having a sub-listing of key sentences. In addition, graphical user interface 490 can display 
Internet-oriented pages, links, and/or buttons. Buttons on the results page allow a user to expand 
key sentences into stories and stories into whole newscasts. Program citations identify specific 
news programs that include the words being searched for. For example, a program can be 
identified by the date it was broadcast, the city (or market) from which the broadcast originated, 
the call letters or name of the station or program source originating the broadcast, the name or 
title of the program, and the time of day the program was broadcast. 

In a preferred form of the present invention, inquiry client 110 sends and receives search 
results and/or text files from central server 130 via a web browser. Numerous methods have 
been developed to provide information over the web in a user-friendly format. Moreover, 
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improvements to existing technology are being developed at a very rapid pace. The system, 
method, and database of the present invention may be adapted to utilize any current or future 
technology for web-based data transfer. For example, server-side scripting is a popular and 
effective environment for presenting data from a server to a user. Currently, the system, method, 
5 and database of the present invention may utilize server-side scripting in the form of one or more 
environments, such as ASP, ISAPI, VB.NET, ASP.NET. 


• 


Referring to Figure 5, central server means processing begins with the receipt of a text 
file 140 from remote capture client means 100. Text file 140 can be processed by a series of 
1(0 subroutines designed to enhance the integrity and uniformity of text files, as well as archive, 
^ index, search, and deliver text files. 

j. m a 

y[js 

J* Text file 140 may be processed by a conditioning routine 510 that is designed to impose 

Q 

S| format consistency across all text files received by central server means 130 regardless of their 

ru 

W origin. Text files from different networks, geographic areas, and locales may use non- 
^ conforming close captioning standards since several format standards are available for use. The 
recommended formatting standard is the standard established by the National Captioning 
Institute (NCI).. NCI standards, amoung other things, specify specific symbolic patterns to mark 
the beginning or end of a single news story within a news broadcast. However, many 
20 broadcasters do not follow NCI standards. Nonetheless, broadcasting stations sometimes specify 
which standards they actually follow. Some broadcasting stations do not specify what standards 
they follow, but patterns can often be detected, either by a human or by computer, that indicate 
specific symbols the broadcasting station may be using in place of the NCI standards. As such, 
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Conditioning routine 510 can apply the particular standards or patterns of a given broadcast 
station to text files captured from that broadcast station, so that the resulting output text more 
closely follows a single standard. Thus, the conditioning routine 510 allows text files that 
deviate from NCI standards to be converted to substantially conform to NCI standards. 
Converting non-conforming text is preferably accomplished by comparing each and every text 
file received by central server means 130 against a database of station-specific formatting 
standards. Conversion routines developed for specific stations are invoked to change a non- 
conforming format to NCI standard formatting. For the present invention, the preferred program 
for performing the comparison and conversion of text files has been developed by Medialink 
Worldwide, Inc. and is referred to herein as ccScrub™, which is indicated in Fig. 4 as reference 
numeral 511. 

A spelling check 512 may be used as part of conditioning routine 510 to impose spelling 
and abbreviation consistency to all sources. For example, the company name "AT&T" is spelled 
many different ways by different broadcasters (e.g., "A-T-and-T," "A T & T," and "A T and T"). 
An abbreviation subroutine in spelling check 412 would search for predefined variations of a 
company name and convert all variations into the standard abbreviation. 

Text file 140 may be processed by a parsing routine 420. Parsing routine 420 is designed 
to parse an entire newscast or television event into its unique story parts or segments. Parsing 
routine 420 can apply user-defined and/or default rules to determine the parsing points within a 
program. Preferably, parsing routine 420 first determines whether a text file contains useful 
data. If a broadcaster has failed to provide captions for a given newscast or broadcast, the 



14 


MEDIA0003CIP 

resulting text file will only contain time stamps. By counting the number of letter characters and 
the number of numeral characters in the file, a ratio can be calculated that may then be used to 
determine if a text file is worth processing. For example, if the letter to number ratio is below a 
user-defined variable, the file can be rejected for further parsing. If a text file is accepted for 
further parsing, markers are created within the text file for denoting the beginning and end of 
segments within text file 140. For example, a user-defined rule can set the end of a segment 
whenever a user-defined variable of time has elapsed without attendant captioning. A default 
rule could search for occurrences of the "»>" symbol, which is the NCI standard symbol for 
denoting the beginning of a new story within a broadcast. Another default rule could search for 
occurrences of multiple time stamps uninterrupted by text and mark the beginning and ending of 
such sequences. A certain number of uninterrupted time stamps may denote either the 
occurrence of a commercial break or other non-captioned segment of a newscast. Once the 
appropriate markers have been inserted in text file 140, the content between each marker can be 
written to a new, separate text file. Each parsed text file preferably includes program origination 
information, total running time, and other segment information to better identify unique 
segments within a program. The new text files are named (e.g., by sequential number) and saved 
(e.g., to a new directory named for the particular newscast being processed). Parsing routine 420 
continues until all identifiable segments of text file 140 have been written to separate parsed text 
files. When all the separate parsed text files have been written, the original text file 140 is 
preferably deleted from the disk. A preferred program for parsing text files has been developed 
by Medialink Worldwide, Inc. and is referred as ccSplit™. 
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Indexing routine 430 creates an index of the words contained in text file 140. The words 
in text file 140 are preferably indexed along with other information about text file 140, such as 
its time and place or origin. The preferred program for use in indexing routine 430 is the 
Microsoft™ Index Server (version 2.0) or the Microsoft ™ Content Indexing Services (version 
3.0). However, any appropriate indexing software may be used. 

Another element of indexing routine 430 is a search engine interface 431 that allows 
users to submit search profiles, as described above, to the index of words. Search engine 
interface 431 could allow searches using Boolean logic. The results data generated by searching 
the index of words against a submitted profile are written to a results database 460. Writing 
search results to results database 460 allows for the ordering and other manipulation of the 
results data. The preferred results database is a structured query language database, such as 
Microsoft™ SQL Server, although other databases may be used. y 

Managing routine 470 supports the need for file management, archiving, restoring, and 
backing-up, as well as satisfying the overall file integrity requirements of central server means 
130. Preferably, an appropriate graphical user interface (not shown) is employed for these 
purposes. System information, including text information, system operational parameters, and 
query postings can be appropriately managed, archived and compressed as needed for system 
calibration, redundancy and report management purposes. 

One particularly preferred embodiment includes automatic search processing routine 480. 
A web page interface from inquiry client means 1 10, for example, allows a user to initiate 
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automatic search processing routine 480. Automatic search processing routine 480 submits a 
search profile to the index of words, either at a pre-determined time, or time interval, or each 
time the index of words is updated. The results are written to database 460 in the same fashion 
as described above. If and when results are written to results database 460, the user is notified 
via a messaging system and/or device, such as an electronic mail message ("e-mail") or a mobile 
telephone. Notification may be initiated by central server means 130 in the form of an electronic 
mail message, a "pop-up window," and/or other suitable forms. In the alternative, notification 
may be initiated when inquiry client means 1 10 establishes communications with central server 
means 130, such as by a dial-up connection. The preferred embodiment of automatic search 
processing routine 480 is a program developed by Medialink Worldwide, Inc. called 
AutoAlert™. 

Optionally, central server means processing may include a document generating routine 
570 for generating documents in printer-friendly format rather than merely for display on a 
monitor. Selected text documents that are sent to inquiry client 1 10 by central server means 130 
to inquiry client means 1 10 are preferably formatted for an Internet-oriented graphical user 
interface (i.e., a web browser). A particularly preferred format is hypertext markup language 
(HTML). Yet, there are cases where a user may want to have selected text documents formatted 
specifically for printing. While it is often possible for a user to print a document initially 
formatted in HTML, the formatting of such a printed document is often undesirable. For 
example, there is no way to specify a section or sub-section header or footer or a specific 
location to start a new printed page in HTML, so a text file that spans more than one printed page 
would only have a citation at the beginning of the printed document. 
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Instead of HTML formatting, a user may elect to have a selected text document formatted 
directly into a printable and reader-friendly format, such as a Microsoft Word™ document, and 
downloaded to inquiry client means 1 10. Such a printable and reader-friendly document may 
5 contain useful information not included in an HTML formatted document. For example, each 
page of such a printable and reader-friendly document may have a page number, section headers, 
the search parameters used to find the document, and/or the time, day, and program from which 
the text in the documents was captured. Furthermore, each printable and reader-friendly 
document can be selectively limited in size (e.g., 250 Kbytes). Preferably, the useful information 
1© added to documents for a particular user can be pre-defined by the user and saved in a user- 
j 3 ^ profile, as discussed above. 

W 

™ Directly formatting text files into a printable and reader-friendly document is particularly 

%J useful and mutually enhances other sub-processes of the present system, such as AutoAlert™, 

fU 

W since the printable and reader-friendly document can be automatically e-mailed to a user. For 
^ example, a user may create and save a search profile on central server means 130, as described 
above. Using routines as described above in reference to central server system processing, a 
printable and reader-friendly document can be automatically generated and sent to the user with 
or without an accompanying report and/or summary. Preferably, the printable and reader- 
20 friendly document is automatically sent to the user utilizing server-side scripting, such as 

Microsoft's™ VB Script™, and/or mail protocols, such as SMTP. This automated search and 
delivery routine may be scheduled to occur at intervals pre-determined either by the user or 
center server means 130. One predetermined event to trigger automated search and delivery 
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routine 480 is when there is a broadcast event containing content that matches a prior search 
profile from inquiry client 110. A reader-friendly, printable document containing the search 
results and/or document is automatically e-mailed to inquiry client 1 10. A preferred means for 
scheduling a search and delivery routine is Microsoft Windows 2000 Task Scheduler, which is 
5 an included component of the Microsoft Windows 2000 operating system. In addition, to 
conserve resources on central server means 130, a supplemental central server means (not 
shown) may be dedicated to the automated search and delivery process. 


Central server means 130 may provide several methods for reviewing text files. The 
1Q3 system optionally provides for retrieval of the complete file from which a key sentence is drawn, 
M thus allowing the user to read in context the full text surrounding any found search term. 

Another system option provides a means for the user to compile only selected sentences and their 
^ citations to a report suitable for printing or otherwise preserving to a user's preferred format. 

3 

^ Yet another system option provides a means for the user to compile selected full text versions of 

Mj found search terms to a report suitable for printing or otherwise preserving to a user's preferred 

M format. 


Preferably, results from database 460 are displayed as a citation with associated phrases 
and/or short sentences that contain the keywords in the search profile. In other words, rather 
20 than returning the full text of entire broadcasts that contain the keywords, results can be initially 
returned as key sentences only. These key sentences, because they contain the keyword or words 
the user is looking for, provide an overview of the full text, allowing the user to quickly scan 
many broadcasts without having to look at the full text of each broadcast. The full text of any 
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broadcast or portion thereof is available to the user, for example, by clicking on an associated 
button and/or link. Thus, users may first see their keywords within the context of specific 
sentences with the option of "zooming out" to see the sentences in the context of a story, and the 
story in the context of an entire broadcast. This hierarchy (citation — > sentence story — » 
newscast) is an efficient way of displaying the results of a search that may return hundreds of 
found text files. 

Results from database 460 may be displayed to a user as keywords or a short summary 
that is linked to the full text file stored on central server means 130. Preferably, the link would 
contain embedded information that could be used by central server means 130 for management 
tasks, such as authentication, security, and/or user subscription. For example, ongoing automatic 
searches may be initiated using search inquires from keywords of interest to a selected business 
and/or interest group. Search results could be written to a database as described above followed 
by a targeted electronic mail message that is tailored for and addressed to the selected business 
and/or interest group. The targeted electronic mail message would contain a phrase or sentence 
from the search results linked to the full text file. By following the link, a user would access 
central server means 130 and view the full text file. Regarding user subscription, the foregoing 
automatic search and targeted messaging routine would be particularly useful in offering trial 
access to potential customers and/or users for searching limited by time and/or subject matter. A 
trial access satisfied potential customer could then become an ongoing subscriber by return 
e-mail or a web-based subscription. 
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The systems, methods, and databases of the present invention are broadly applicable. By 
way of example, a company that manufactures products may wish to know whether broadcast 
programs are discussing the company and/or its products, and whether such discussions are 
favorable and/or unfavorable. There is a real-time need for such information. The present 
invention permits a company to automatically access such information or to have such 
information automatically electronically transmitted to the company (e.g., via e-mail) for study 
or appropriate action. Likewise, a company may be able to automatically search and/or monitor 
authorized and unauthorized uses of its trademarks and trade names in broadcast programs and 
commercials, such as in tracking its own commercials and commercials by competitors, and to 
automatically receive related search results and reports. 

Another commercial aspect of the present invention is an improved mechanism for 
attracting prospective customers or users of the present invention. A search profile tailored to a 
prospective customer and/or similarly interested prospective customers may be formulated and 
entered into the system in a manner as previously described. The system will automatically and 
periodically contact (e.g., via e-mail) the prospective customer(s), and automatically provide 
search results in one or more formats (e.g., a summary report) for examination by the prospective 
customer(s). The prospective customer(s) may then subscribe via the Internet, e-mail, or 
otherwise. In this manner, the system, method, and database of the present invention provide a 
powerful sales promotional tool at minimal costs and with effectively no sales personnel. 

In the embodiments of the present invention described above, it will be recognized that 
individual elements and/or features thereof are not necessarily limited to a particular embodiment 
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but, where applicable, are interchangeable and can be used in any selected embodiment even 
though such may not be specifically shown. It will thus be seen that the objects set forth above, 
among those elucidated in, or made apparent from, the preceding description, are efficiently 
attained and, since certain changes may be made in the above construction without departing 
from the scope of the invention, it is intended that all matter contained in the above description 
or shown on the accompanying drawing figures shall be interpreted as illustrative only and not in 
a limiting sense. It is also to be understood that the following claims are intended to cover all of 
the generic and specific features of the invention herein described and all statements of the scope 
of the invention that, as a matter of language, might be said to fall there between. 


22 


